SYMPLECTIC EMBEDDINGS AND CONTINUED 
FRACTIONS: A SURVEY 



DUSA MCDUFF 

Abstract. As has been known since the time of Gromov's Nonsqueez- 
ing Theorem, symplectic embedding questions lie at the heart of sym- 
plectic geometry. After surveying some of the most important ways of 
measuring the size of a symplectic set, these notes discuss some recent 
developments concerning the question of when a 4-dimensional ellipsoid 
can be symplectically embedded in a ball. This problem turns out to 
have unexpected relations to the properties of continued fractions and 
of exceptional curves in blow ups of the complex projective plane. It is 
also related to questions of lattice packing of planar triangles. 



1. Overview. 

The standard symplectic structure on M. 2n is: 

ujq = dx\ A dx2 + dxs A dx^ + • • • + dx<in-\ A dx2 n - 

By Darboux's theorem every symplectic form is locally diffeomorphic to 
this one, so it is crucial to understand its properties. 

Let B := B 2n (a) C M 2 ™ be the standard (closed) ball of radius y/a (so a is 

proportional to a 2-dimensional area), and let <f> '■ B ^ 2n be a symplectic 
embedding (i.e. a smooth embedding onto 4>(B) that preserves loq). Since 
lJq = n! dx\ A • • • A dx2 n is a volume form, every symplectic embedding 
preserves volume. 

Gromov's question: What can one say about the images 4>(B) of a sym- 
plectic ball? How are the symplectic and volume preserving cases different? 

The following result can be easily proven using Moser's homotopy method. 
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Volume preserving embeddings: if V C K is diffeomorphic to a ball 
and vo\V = volB then there is a volume preserving diffeomorphism ip : 

The analogous statement is not true for symplectic diffeomorphisms — called 
symplectomorphisms for short — since the boundary of V has symplectic 
invariants given by the characteristic foliationj^] Nevertheless one might ask 
if one can fully fill V by a symplectic ball. This means that for every 
£ > there is a ball B and a symplectic embedding <p : B^V such that 
vol(V\cp(B)) < e. 

Let Z{A) be the cylinder 

B 2 {A) x R 2n - 2 = {(xi, x 2n ) G M 2n : x\ + x\ < A} 

We provide Z{A) with the symplectic structure u>q. Thus its first two co- 
ordinates lie in symplectically paired directions. In particular, loq has non 
zero integral over each disc B 2 (A) x {pi}. 

Gromov's Nonsqueezing Theorem: There is a symplectic embedding 

<t> : B 2n (a)^Z(A) 

if and only if a < A. 




Figure 1.1. Does the ball embed symplectically in the cylinder? 

This nonsqueezing property is fundamental. To a first approximation, 
it is true that a diffeomorphism with this nonsqueezing property for all 
balls and all symplectic cylinders must preserve the symplectic structure. 
(More precise versions of this statement were proven by Eliashberg |8j and 
Ekeland-Hofer [7].) 

It follows also that symplectomorphisms are very different from volume 
preserving embeddings. For example, the largest ball that embeds symplec- 
tically in the polydisc B 2 (l) x B 2 (l) is B A (1) which has volume just half 
that of the polydisc. 



The characteristic foliation on a smooth hypersurface Q is spanned by the unique 
null direction of ui\q, i.e. by the vectors v G TQ such that w(v, w) — for all w G TQ. 
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1.1. Symplectic capacities. In their paper [7], Ekeland and Hofer formal- 
ized the idea of a symplectic capacity, which is a measurement c(U, to) of the 
size of a symplectic manifold (U,co) (possibly open or with boundary) with 
the following properties: 

(i) c is a function with values in [0, oo] defined on some class of 2n- dimensional 
symplectic manifold^ (M, u>) that contains the closures of all open subsets 
of Euclidean space; 

(ii) (monotonicity) if there is a symplectic embedding (U,oj) — > (U' \oo') then 
c(U,lo) < c{U',u'); 

(hi) (scaling) c(U, Xu) = Xc(U, uj) for all A > 0; 

(iv) (normalization) c[B 2n (a)) > and c(Z 2n (a)) < oo. 

The monotonicity property implies that c is a symplectic invariant, that is, 
it takes the same value on symplectomorphic sets, while the scaling property 
implies that it scales like a 2-dimensional invariant. One can satisfy the first 
three properties by considering an appropriate power of the volume; for 
example one could consider the function c(U,uj) := (/^ u) n ) l ^ n . However, 
this function c does not satisfy the second half of the normalization axiom. 
Indeed, the requirement that a cylinder has finite capacity is what makes 
this an interesting definition. 

Here are some examples that illustrate some of the variety of possible 
definitions. 

(i) The Gromov width. c g (U,uj) = sup{na\ B 2n (a)^(U,uj)}. 

The nonsqueezing theorem implies that this does satisfy the axioms. 
Moreover, it satisfies the following strong normalization condition: 

c{B 2n (a))=c(Z 2n (a)). 

(ii) The Hofer Zehnder capacity [lTj. Every smooth function H : M — » 
R on a closed symplectic manifold generates a 1-parameter subgroup t S 
M, of the group of symplectomorphisms of (M, uj) by the following procedure; 
define the vector field Xh, the symplectic gradient of H, by requiring that 

uo(X H ,-)=dH(-), 

and then define §f to be the flow generated by Xu with <pQ = id. The 
identity 

Cx H {to) = d{i{X H )uj)) + i{X H )duj = d{dH) = 0, 

(where C denotes the Lie derivative) shows that (cj)^)*uj = uj for all t. In 
other words, the flow preserves the symplectic form. A point x for which 
there are times < t < T such that 4>^(x) = x but 4>^{x) ^ x is said to be 

2 A smooth manifold M is said to be symplectic if it is provided with a symplectic form 
uj. Here uj is any closed 2-form such that uj" never vanishes. A fundamental result due 
to Darboux is that every symplectic form is locally diffeomorphic to Euclidean space R 2n 
with its standard structure ujq. Hence the importance of understanding the symplectic 
properties of subsets of R 2n . 
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a nontrivial periodic orbit of <f>^ of period T. Further we define Ti to be the 
set of functions H : M — > R with the following properties: 

• H(x) > for all x G M, 

• there is an open subset of M on which H = 0; 

• H is constant outside a compact subset of the interior of M; 

• every nontrivial periodic orbit of H has period T > 1. 

Then we define the Hofer-Zehnder capacity as follows: 



This function chz obviously satisfies the first three conditions for capacities. 
Moreover as A increases the flow of XH moves faster so that the periods of 
the periodic orbits decrease. Therefore in order to prove that a set such as 
B 2n has finite capacity one needs a mechanism to prove that periodic orbits 
for <j)^ must exist under suitable circumstances (for example, if H = XK 
where K satisfies the first three conditions to be in Ti and A is sufficiently 
large.) In the original papers this mechanism involved subtle arguments 
in variational analysis; however one can also prove such results using J- 
holomorphic methods. 

(iii) The displacement energy. If Ht ■ M — ► M., t G [0, 1] , is a smooth fam- 
ily of functions then one can define a path of vector fields Xjj t by requiring 
that 



and then integrate this family of vector fields to get a flow <pt ,t 6 [0,1]. 
As before, each diffeomorphism preserves u>, and the family is called a 
Hamiltonian path, or Hamiltonian isotopy. Define the length of 
such a path by setting 



(Thus this is a measure of the total variation of Ht.) 

If U C (M, uj) then a Hamiltonian isotopy of M is said to disjoin U 
in M if <pi(U) H U = 0. The displacement energy of a subset (U,uj) of a 
symplectic manifold (M, u) is defined to be: 



where we take the infimum of an empty set of real numbers to be oo. It is a 
deep fact that the displacement energy of a ball is always positive, no matter 
what M is. Using this, it is easy to see that this energy defines a capacity 
on the set of all subsets of a given symplectic manifold (M, uj). Interestingly 
enough, as explained in [TH] there are relations between the displacement 
energy of a ball and the nonsqueezing theorem. 

Although the displacement energy of a set (U, uj) in principle depends on 
the choice of ambient manifold (M,uj), I am not aware of any examples of 




u(X Ht , •) = d(H t )(-) for each t 6 [0, 1] 




d M (U,uj) 



inf{£({0f }) : disjoins U in M} 
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sets U that have different displacement energies when considered as subsets 
of two different manifolds (Mi,uj±) and (M2,W2), except in the trivial case 
when U is displaceable in M\ but not in M2. As a refinement of this question 
one could consider two different symplectic embeddings i\ and 12 of one set 
(U,uj) into another (M,uj). Are there any examples where cIm(i>i(U, uj)) / 
dM(t-2(U,cu))? This will not happen if the two embeddings are Hamiltonian 
isotopic (i.e. there is a Hamiltonian isotopy 4>f* of M such that l% = 4>i o 
Li). But there are cases in which nonisotopic embeddings are known to 

exist; in other words, the space of embeddings (U, uj)^(M, uj) need not be 
connected. Such examples (of embeddings of one polydisc B 2 {a\) x B 2 (a2) 
into another) were constructed in [3]. This leads into the whole question 
of what is the topology of such embedding spaces. Rather little is known 
about such questions. 

The above three capacities are by now classical invariants. There are 
many other more recently introduced functions that share some of the prop- 
erties of capacities. For example, Biran and Cornea [3, §6,6] are interested in 
exploring the "size" of Lagrangian submanifolds L of (M, uj). A Lagrangian 
submanifold is a submanifold of dimension n on which the symplectic form 
vanishes identically. A basic example is the subspace Vr of (M 2n , ujq) spanned 
by the coordinates xx,xs, . . . , x^n—i- Lagrangian submanifolds are funda- 
mental objects of study in symplectic geometry. 

Let us write B^ 1 for the intersection B 2n {a) n Vr. Then the width w(L) 
of a Lagrangian submanifold L C (M, uj) is defined to be the supremum of 
ira for which there is a symplectic embedding 

<A:CB 2 >),<(a))A(M,L) 

such that = -B| n (a)- Although some calculations can be made, it 

is not even known whether every compact Lagrangian submanifold of M? n 
must have finite width. 

Because so little is known about these general embedding questions, we 
shall now return to considering balls and ellipsoids. As we shall see, their 
embedding properties are very closely related, at least in dimension 4. We 
shall first consider the problem of embedding several equal balls, and then 
discuss ellipsoids in ^2] 

1.2. Symplectic ball packing. Question: How much of the volume of 
B(l) can be filled by k equal symplectic balls? 

[Gromov [1 1 J ] : there are obstructions when k = 2: 
if B(a)UB(a)^B(l) then a < 1/2; 

in other words, the volume of the image of B(a) U B(a) is < ^xvol (B(l)). 

[McDuff and Polterovich |22j: for any d > 1 the 2n- dimensional ball B{1) 
can be fully filled by k = d n equal balls. 
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One can explicitly see these embeddings in the following way. 

Describing embeddings by toric models: 

Consider the (moment) map <& : C 2 — > M. 2 , <&(zi, z%) = {\zi\ 2 , \ z 2\ 2 )- 
Then $(5(1)) is the (standard) triangle {0 < x\,Xi\ x\ + xi < l}. Thus 
the inverse image of the closed triangle can obviously be fully filled by a 
ball. Somewhat surprisingly, Traynor [27] showed that it is possible to fully 
fill the inverse image of the interior of any standard triangle. Hence, we can 
obtain full fillings of -B 4 (l) by d 2 equal balls by cutting a standard triangle 



into d 2 standard pieces as in Figure 1.2 




Figure 1.2. Embedding d 2 standard triangles into a triangle 
of size 1. 



However, when k ^ d 2 the story is different: 

[McDuff-Polterovich [22 ]: In dimension 4 there are obstructions to full fill- 
ings by k balls when k < 9, k ^ 1, 4; 

[Biran [T]] B A (1) can be fully filled by k equal balls for all k > 9. 

Thus, with enough balls, the obstructions disappear. In later work [2], 
Biran showed that for all (closed) rationaj^] symplectic 4-manifolds (M, uS) 
there is an integer N = N(M,uj) such that (M, uS) can be fully filled by k 
balls for all k > N. This is rather surprising. Note that N depends on uj 
and can diverge to infinity even when M is fixed. For example, in the case 
when M = S 2 x S 2 and u>k = \p r l°~ x P r 2 a f° r some area form a on S 2 of 
total area tt, the nonsqueezing theorem shows that one cannot embed a ball 
larger than i? 4 (|). Since the volume of (M, u>k) is j- while that of -B 4 (j:) 

2 

is we find that N{M,u)k) > 2/c. Thus, it is not at all clear whether 
one can remove the rationality condition on [u] . It is also not clear whether 
there is an analogous result in higher dimensions. 



2. Embedding ellipsoids into balls 



s 



From now on we shall work in 4 (real) dimensions, and shall write E^B 
to mean that there is a symplectic embedding of E into B. Define the 
ellipsoid E(a, 1) by setting: 

2 2 

E(a,l) ={(x 1 ,...,x 4 ) £l 4 : X ^^+x 2 + x 2 < 1}. 



3 (M,w) is called rational if [a] e H 2 (M;Q). 
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Consider the embedding capacity function c for a > 1: 

c(a) := inf{/i : E(a, 1)^%)}. 
Note that c(a) > because vol£'(a, 1) = voli?(- v /a). 

In [5], Cieliebak, Hofer, Latschev and Schlenk describe a wide variety 
of symplectic embedding problems in quantitative terms, formulating many 
interesting questions, but providing relatively few answers. (See also Schlenk 
[26].) In particular, they define the function c(a) described above, but could 
say rather little about its properties. The first most significant open question 
concerned the value of c(4). This was in fact first calculated by Opshtein 
|24j . though he did not make this explicit in his paper. 

As we shall see, the function c(a) turns out to be surprisingly interesting. 
It is the first of the capacity functions of |5; to be calculated]^] Note also 
that although the nonsqueezing theorem stated above is valid in all dimen- 
sions, we only understand the embedding of ellipsoids in 4-dimensions. As 
is shown by the work of Guth [12] and Hind-Kerman |15j . very interesting 
new phenomena appear in higher dimensions. 

3 



2 



1 



123456789 

Figure 2.1. The approximate graph of c(a). Here b := 

Theorem 2.1 (McDuff-Schlenk) . Let r = The graph of c(a) divides 

into three parts: 

• ifl<a<r 4 the graph is piecewise linear - an infinite Fibonacci staircase 
converging to (r 4 ,r 2 ); 

• t 4 < a < is a transitional region; c(a) = \fa except on a finite number 
of short intervals; 

• if a > 8^ = (^-) 2 then c(a) = ^fa. 

4 In fact, the proofs of the results stated below are not yet quite complete, so that at 
this stage they should still be considered conjectural. 
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Description of the Fibonacci stairs: Let 

51 = 1, 52 = 2, 53 = 5, 54 = 13, 55 = 34, 5 6 = 89,..., 

be the odd terms in the sequence of Fibonacci numbers; set 

in : =(5n+i/5n) 2 , b n := g n +2/9n 

so that a n < b n < a n+ i, and a n — ► r 4 . Here we set 50 := 1 for convenience, 
so that ao = 1, bo = 2. Then the claim is that for all n > we have 

c(x) = x/^/a^ on [a n , b n ], and c{x) = ^a n+ i on [b n , a n+ i]. 
Since -fe= = A /a n _i_i this gives a continuous graph on the interval 1 < a < r 4 . 

V ^ V 1 

For example, 

c(2) = c(6o) = v^l = 2 = c(4), c(5) = c(h) = \ = c(6|). 

Note that the function c(a) /a is continuous and nonincreasing. (This holds 
because, for all A > 1 and e > 0, we have E(\a, 1) C E(Xa, A) = XE(a, 1) C 
\B(c{a) + e).) It follows that the function c is determined on the interval 
[1, t 4 ] by its values at the points a n ,b n . 

As we describe in more detail below, the first and third parts of Theo- 
rem 2.1 are fully proven, while the proof of the second statement is almost 



complete. The proofs are based on the two Propositions |2.2| and 2.4 stated 
below. The first relates the ellipsoidal embedding problem to a ball packing 
problem, while the second gives a recipe for solving ball packing problems. 

Proposition 2.2 (McDuff [21]). For each rational a > 1 there is a finite 
weight expansion w(a) = (wx, . . . , wm) such that E(a,l) embeds symplec- 
tically in the interior of B 4 (fi) if and only if the disjoint union of balls 
[J B(wi) embeds symplectically in the interior of B 4 (fi). 







2 2 2 N 
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Figure 2.2. w(25/9) = (1, 1, 7/9, 2/9, 2/9, 2/9, 1/9, 1/9) 
which we abbreviate as (l x2 , 7/9, (2/9) x3 , (1/9) x2 ). The 
multiplicities 2, 1, 3, 2 of the weights are the terms (or partial 
quotients) of the continued fraction representation [2; 1,3, 2] 
of 25/9. 



Weight expansions are perhaps best explained pictorially as in Figure 
but here is a formal definition. 



2.2 
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Definition 2.3. Let a = p/q E Q written in lowest terms. The weight 
expansion w := (wi) := (wi, . . .Wk) of a > 1 is defined recursively as 
follows: 

• w\ = 1, and w n > w n+ i > for all n; 

• if Wi > Wi + i = ■ ■ ■ = w n (where we set wo := a) then 

_ ( w n if Wi+i H h w n +i = (n — i + l)m+i < Wi 

n+1 \ Wi — (n — i)wi + \ otherwise; 

• the sequence stops at w n if the above formula gives io n +l = 0. 

The number k of nonzero entries in w(a) is called the length 1(a) of a. 

The second main proposition solves the ball packing problem. Given k 
write m := (mi, . . . , m/%) £ N fc , > mj+i. Let 

£ := £fc : = {(d; m) : d 2 + 1 = mf , 3d— l=^^mj, and (*)}. 

i i 
We shall refer to the first two conditions above as Diophantine conditions. 
The third condition (*) is more algebraic, requiring that tuple (d; m) can 
be reduced to (0; 1) by repeated Cremona moves. Such a move takes (d, m) 
to (d';m'), where (d';m') is obtained by first transforming (d; m) to 

(2d — mi — rri2 — m^; d — m,2 — ms, d — rrt\ — m.3, d — mi — 7712, 771,4, ir>>5 ■ ■ ■), 

and then reordering the new mi (discarding zeros) so that they do not in- 
crease. 

It was shown in Li-Li [20] (using Seiberg-Witten theory) that is the 
set of homology classes dL — ^ miEi represented by symplectic exceptional 
divisors in the fc-fold blow up of CP 2 j^] It follows that the intersection num- 
ber (d; m) • (d'; m') := dd' — ^ 777^777^ of any two elements in £ is nonnegative. 

Proposition 2.4 (McDuff-Polterovich (22], Biran (I|). 

U B{wi)^->B(p) <^=^ a < /i 2 and fid > miWi V(<i; m) G £^. 

i<k — ' 

Corollary 2.5. c(a) = sup| y/a, sup C m ^»W I 

Example: £4 has the single element (1; 1, 1) (corresponding to L — E\ — E2) 
and w(4) = (1, 1, 1, 1) = (l x4 ). Therefore c(4) = 2. However £5 also 
contains (2; 1, . . . , 1) = (2; l x5 ) corresponding to 2L — Yli=i Thus c(5) = 
sup{ v / 5,2,5/2} = 5/2. 

Here we denote by L the class of a line [CP 1 ] and by Ei the class of the ith exceptional 
divisor. For more about blow ups see §2.2| Observe also that this is a place where 
symplectic geometry shows how flexible it is in comparison with algebraic geometry. In 
algebraic (or complex) geometry, one would want to describe the homology classes that can 
be represented by holomorphically embedded exceptional divisors for a "generic" complex 
structure on the blow up. This question is far from being understood. 
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The existence of the Fibonacci staircase in Theorem 12.11 is based on the 
somewhat surprising discovery that there are elements of £ related to the 
weight expansions of the odd Fibonacci numbers. 

Proposition 2.6 (McDuff-Schlenk [23]). Denote the odd Fibonacci numbers 
by g n ,n > 1, as above, and define a n := (g n +i/g n ) 2 and b n := g n+2 / 1 g n - 
Then: 

(i) E{b n ) := (flW+i; 5nW(6 n )) G £. 

(ii) E(a n ) := (g n g n+1 ; gl(w(a n )),l) G £. 

Idea of proof. It is not hard to show that the elements E{a n ) and E{b n ) 
satisfy the Diophantine conditions needed to be in £. Moreover, there is 
an easy inductive proof that E{b n ) satisfies the third condition since E(b n ) 
reduces to E{b n -2) under five Cremona moves. However, the corresponding 
behavior of E{a n ) is much more complicated and takes much more effort to 
analyze. □ 

Example: n = 2 gives: (5;2w(13/2)) = (5;2 x6 , l x2 ), while 

5L - 2(E X + ■ ■ ■ + E 6 ) - E 7 - E 8 G £ 8 . 
Similarly, n = 4 gives 

(34; 13w(89/13)) = (34; 13 x6 , 11, 2 x5 , l x2 ), 
and one can check that 

34L - 13(£i + • • • + E 6 ) - 11E 7 - 2(E 8 + ■■■ + E 12 ) - E 13 - E u G £ . 



Corollary 2.7. For all n > 1, c(a n ) = yj~a^ and c(b n ) = ^a n +i = c(a n+ i). 
Proof. Since the intersection of any two elements of £ is nonnegative, Propo- 



sition 



2.6 part (ii) implies that for any (d; m) G £ we have 



(d; m) • E(a n ) = dg n g n+1 - Y%n miWi(a n ) - m k+1 > 0, 



where k := l{a n ). Thus 

m-w(a n ) _ Yy m i w i( a n) < dg n g n+ i 



d d dg l n "" 

It follows that c(o n ) = i/a^. 

On the other hand, part (i) of the proposition implies that 

miWi(b n ) => gn+i/J- > g n ^ Wj{b 

n) — grfin 

=> v > (g n g n +2) I '(gn+Wn) = V a «+i- 

This shows c{b n ) > ^/a n +i- Since b n < a n+ \ we must have c(b n ) < c{a n+ i) = 
V a n+i- Thus c(b n ) = c(o n+ i). □ 
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Claim I: c(a) = on the interval [t a ,7], with obstruction given by the 
class (d;m) = (3,2,l x6 ) G £ 7 . 

So far this has been proved for a > 6^ an d for the (even) convergents 
to t. The proof is quite hard because there are "fake" elements of £. More 
precisely, by looking at ratios of even Fibonacci numbers one can construct 
tuples (d; m) that satisfy the Diophantine conditions to be in £ but that fail 
condition (*); in fact they have negative intersection with (3,2, l x6 ), that is 

3d < 2m\ + rri2 + • • • + m-j. 

There are also infinitely many other elements of £ , this time constructed 
from the even Fibonacci numbers, for which (]T] miWi(a))/d = on some 
interval [t 4 ,t 4 + e]. (These form what might be called a ghost staircase 
because it lies below the graph of c(a).) Thus one cannot understand c(a) 
on this interval by easy estimates using the Diophantine conditions. 

On the other hand, this approach is sufficient to show that there are 
no obstructions on the interval a > 8-^ and that there are finitely many 
obstructions on the interval 7 < a < 8^. Full details of the proof will be 
given in |23j . 

Why there are no constraints for a > 9. 

We saw above that c(a) = sup( dm -) g £ S m ^"»( a ) _ Hence c(a) > y/a only if 
there is (d, m) G £ such that ^ miWi{a) jd > ^fa. But ^/a > 3, and 

rriiWi < mi = 3d — 1 since all Wi < 1. 

So ^iriiWi/d < 3 always. This was Biran's argument in |b. More com- 
plicated versions of this argument also show there are no constraints for 
a>8^. 

The special properties of a = r 4 . 

a = t 4 is the positive root of 3^/a = a + 1. The sharpest constraints come 
from (d, m) where m ~ Aw(a). Since = a and ^2m? ~ d 2 , we need 

A ~ dj ' \fa. But also 

Wi = 1 + a — 1/q 1 + a, \wi w rm « 3d 

i 

implies 3d A(l + a) or 3-y/a 1 + a. 

2.1. Relation to lattice packing and embedded contact homology. 

The embedded contact homology theory of Hutchings and Taubes |18] is 
not yet fully developed. But, according to Hutchings, the index calcula- 
tions in this theory should provide a series of obstructions to symplectically 
embedding E(a, 1) into B(fi) of the following nature. 

Let a > 1 be irrational. For each pair of integers A,B > 0, consider the 
triangle 

T% B := {(x,y) G R 2 : x,y > 0,x + ay < A + aB). 
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We define 

where d is the smallest positive integer such that 

#{t%,b nz 2 ) < |(d + i)(d + 2) + fc-i, 

where /c is the number of integer points on the slant edge of T| B . Further, 
set 

cech{o) := sup{k A ,B(a)} . 

A,B 

Since cech(o) is nondecreasing, we may extend it to rational a by defining 

cech{o) = sup cech(z). 
z<a,z irrat 

Claim II: cech(cl) < c(a) /or a// a. 

As mentioned above, Claim II is not yet fully proven, but it is expected to 
hold. Further, it seems probable that these two functions actually coincide. 
As evidence for this, we have the following results that were explained to 
me by Hutchings. 

Lemma 2.8. Suppose that a is rational, define T = T^ B as above and 
suppose that 

#{TDZ 2 ) < \{d+l){d + 2) + k-l = l{d 2 + 3d) + k, 

where k > 1 is the number of integral points on the slant edge of T. As- 
sume that (A, B) (resp. (A',B')) is the integral point on the slant edge with 
smallest (resp. largest) x-coordinate. Then there is e > such that 

cech(z) > ifze (a-e,a), c EC h(z) > if z G (a,a + e). 

Proof. To prove the statement for z < a it suffices by continuity to consider 
irrational z of the form z = a — e. Then, for small enough e > 0, the triangle 
TJ B contains k — 1 fewer integral points than T. Therefore kA,B(z) > A+ ^ B , 
which proves the first statement. Similarly, the second statement holds 
because if z = a + e is irrational and e > is sufficiently small, the triangle 
T|, B , contains k—1 fewer integral points than T. □ 

Lemma 2.9. For all n > 1, CECH(b n ) > y/a n +i- 

Proof. Consider the triangle T n C M? with vertices (0,0), (g n+ 2,0) and 
(0,g n ), where g n is the nth odd Fibonacci number. Because g n ,9n+2 are 
mutually prime and satisfy the identities 

9n + 9n+2 = 3g n +l, g n g n +2 = 9n+l + 1> 

we find that 

#(T n nZ 2 ) = \(g n + l)(g n+2 + l) + l 

= M +1 + 3<? n+1 ) + 2. 
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Therefore, because b n = ^f-, we can apply Lemma 
for some e > we have 



2.8 



with k = 2. Hence 



cech(z) > when z £ (b n — e, b n ], 

and 

cech(z) > when z e [b n , b n + e). 
In particular, c EC h{K) = f^f = V a ™+1- D 

The arguments that prove Claim II should also prove that cech has the 
same scaling property as c, namely the function ° ECH ^ a ' should be nonin- 
creasing. If so, one can conclude that cech{o) = c(a) for a G [l,r ]. To see 
this, observe first that Lemma |2 . 9| would imply that the two functions agree 
at a = b n , and second that c(a) is the smallest nondecreasing function with 
the scaling property that takes the given values at b n . 

Numerical evidence suggests that cech (a) = c(a) for all a. It seems very 
likely that, once c(a) is fully calculated, one could verify this by finding 



suitable triangles as in Lemma 2.9 



To my knowledge, rather little is known about lattice counting functions 
such as cech- Here is one very simple result. 

Lemma 2.10. CEcii{ a ) ^ fo r all a > 1. 

Proof. This holds because when s is large the number of integral points in 
T s a is estimated by its area \s 2 ja with error O(s). Therefore k s fl — ► y/a as 
s — > oo. Since A + Ba = s in this case, the result follows. □ 

In [13] , Hardy and Littlewoocj^] consider rather different asymptotic ques- 
tions about the number of lattice points in T\ B , looking at asymptotical 
behavior for fixed a rather than comparing the triangles for a given a with 
those for a = 1 as we do. Interestingly enough the golden ratio r also plays 
a prominent role in their work. 

Remark 2.11. Given positive numbers a, b with a < b, denote by N(a,b) 
the list (with repetitions) of all nonzero numbers of the form ma + nb, 
where m, n are nonnegative integers, put in nondecreasing order. Thus 
JV(1,1) = (1,1,2,2,2,3,...). Write 

N(a,b) 4 N(c,d) 

if for all k > 1 the k entry of iV(a, b) is no greater than the kth entry of 
N(c,d). Then it is easy to check that 

cech (a) = inf{^ : N(a, 1) ^ N(/2,fi)}. 
Thus, if CEcn( a ) = c(a), it would follow that 

£(a,l)AB( M ,/i) N(a,l) 4 AW)- 

^ I am indebted to Peter Sarnak for this reference. 
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As pointed out by HoferQit is an easy step from here to wonder whether 

E(a,l)^E(c,d) N(a,l) 4 N(c,d). 

Note that the embedding results of Guth [TT] and Hind-Kerman [15] im- 
ply that the analogous result in dimensions > 4 does not hold. However, 
embedded contact homology also does not exist in dimensions > 4. The 
question of what the obstructions actually are in higher dimensions is very 
interesting. 

2.2. Ideas behind the proofs. Finally, we describe the basic ideas behind 



the proofs of the two key Propositions 2.2 and 2.4 

I: Embedding balls and blowing up: 

To blow up a point in complex geometry you remove the point and replace 
it by the set of all (complex) lines through that point. In two complex 
dimensions, this set is a complex line E = CP 1 called the exceptional 
divisor. Its normal bundle tt : L — > E has Chern class —1. In this context 
the basic fact is that L^E is biholomorphic to C 2 \{0}. 

On the other hand, in symplectic geometry the picture of blowing up is 
somewhat different. Darboux's theorem shows that a neighborhood of a 
point p in (M 4 , o>) can be identified with a neighborhood of {0} in M 4 = C 2 . 
Therefore one can blow up as before (with respect to a compatible complex 
structure) to get a manifold M with blow down map bl : M — > M. But one 
must put a symplectic form on M, and the pullback form bl*{oj) vanishes on 
E. So the symplectic structure to a on M must have the form bl*(u))+a / K*{aE) 
near E, where J E <je = 1 and vr : nbhd(i£) — > E. In this context the basic 
fact is the following: 

((nbhdE 1 \ E),u a ) is symplectomorphic to C 2 \B(a). 

This means that to blow up symplectically with weight a one removes an 
embedded copy of the open ball intS(a) and then collapses dB(a) = S 3 to 
S 2 = E by the Hopf map (along the leaves of the characteristic foliation). 



Proposition 2.4 is proved by using this description of symplectic blowing 
up to convert the ball packing problem into a question about the existence 
of symplectic forms on the fc-fold blow up of CP 2 . The latter problem is 
solved using the theory of J-holomorphic curves as in |22| [1] . 

II: Cutting E(a, 1) into balls via toric models 



We now describe the basic ideas behind the proof of Proposition 2.2 
If a = p/q the image of E(a, 1) under the moment map : C' 
I 2 , (zi, Z2) i— > (|zi| 2 , | -2^2 1 2 ) is the triangle 

{0<xi,x 2 ; qxi +px 2 < p}. 



at a problem session at the Introductory Workshop to the Symplectic Topology 
program, August 21, 2009 at MSRI, Berkeley. 
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As illustrated on the left of Figure |2.3[ this corresponds to a singular variety, 



with singular points of orders p, q. (In contrast, the triangles in Figure 1.2 
are standard, i.e. equivalent under an integral affine transformation to a 
triangle with vertices (0, 0), (a, 0), (0, a), and so represent the smooth variety 
CP 2 .) We can cut this singular triangle into standard triangles of different 
sizes. 



(3,5) 






1 




3 










i 



Figure 2.3. Cutting triangles into standard triangles as 
compared to cutting a rectangle into squares. 



Figure 2.3 also shows that the triangle decomposition on the left is struc- 
turally the same as that giving the weights for 5/3. One needs to remove 
the top right corner of the rectangle and then collapse its top and right side. 
As we explain in more detail in [21] , it also corresponds to a joint resolution 
of the two singularities of the toric variety corresponding to the complement 
of the triangle in the positive quadrant. 





Figure 2.4. The lines on the right hand figure are the cuts 
needed to resolve the singularities of the complement of the 
triangle in the left hand figure. 



Figure 2.4 illustrates another key point. The cuts needed to resolve the 
two singularities in the shaded polytope (the result of removing the singular 
triangle) are illustrated on the right. Note that they are parallel to the cuts 
that decompose the singular triangle on the left of Figure [273] into standard 
triangles. (These cuts resolve the singularities because the matrices whose 
rows are adjacent normal vectors all have determinant ±1.) 

Remark 2.12. The information contained in the weight expansion of a can 
be summarized in a diagram called the Riemenschneider staircase. This 
usually arises in the context of resolving singularities so that the interesting 
numbers are the coefficients of the Hirzebruch-Jung continued fraction ex- 
pansions of the two related fractions p/q and p/(p — q) (where we assume 



16 



DUSA MCDUFF 



p > 2g); cf. Fulton [10]. However one can also construct the diagram from 
the multiplicities m, ■ ■ . , ns of the entries in the weight expansion w(a). We 
illustrate this by the two examples in Figure 2.5 See Popescu-Pampu 
for an extended discussion of these combinatorics; and Craw-Reid [B] for 
an example of the use of this diagram in current algebraic geometry. Also 
compare [2U Remark 3.12(h)]. 



3 l . 2 . 2 
2 i m 



(I) • 



4 2 # s . 5 # 5 o 

2 2 . 



3 2 V 3 : i f 
< 



2 4 2 2 




(II) 



FIGURE 2.5. (I) illustrates a = p/q = 5/2 = [2; 2] while (II) 
illustrates a = p/q = 17/5 = [3; 2, 2]. The labels Wi(a) are 
the smaller numbers by the dots; the vertical and horizontal 
numbers give the components of the Hirzebruch-Jung con- 
tinued fractions for p/q > 2 and p/(p — q) < 2. The arrows 
give the direction in which one should read the numbers. 



Observe first that if p, q are relatively prime with p > q then exactly one 
of the numbers p/q,p/(p — q) is greater than two. Thus, without loss of 
generality, we may assume that p/q > 2 so that no > 2. Given positive 
integers no, . . . , define 

| = [n ;ni,n 2 ,...,n fc ] = n ' 



m + 



It is not hard to see that the entries of the weight expansion w(|) 
(wo, . . . , wn) have multipicities no, . . . , n^. Moreover qwi E Z for all i 

(Wo,---,^v) := 



other words, the vector W(-) 



qw(-) has the form 



Xq, • • • , Xo, X\, . . . , Xi, . . . , Xfc, . . . , Xfc , 



no 



In 



where Xi E Z and = 1. We shall call these renormalized weights W% = 
qwi the labels of a = |. 

The staircase is constructed from the upper right, proceeding to the left 
and down, by first placing no horizontal dots, then n\ vertical dots (starting 
in the row of the first dots), then n-2 horizontal dots (starting in the row 
of the second dots), and so on. The first dot is white, and the others are 
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black. One can recover the labels from the staircase by starting with 1 at 
the bottom left and moving back up the staircase according to the rule: 

at a horizontal move, the new label is the sum of the labels in the 
preceding column; at a vertical move, the new label is the sum of the 
labels in the preceding row. 

The coefficients of the Hirzebruch-Jung continued fraction for p/q are one 
more than the number of black dots in each row (read downwards). If one 
moves the white dot, putting it in a new row instead of a new column, 
and constructs the labels as described above, then one gets the labels for 
the associated fraction p/(p — q). Correspondingly the coefficients of the 
Hirzebruch-Jung continued fraction for p/{p — q) are one more than the 
num ber of blacks dots in each column (read from right to left). Thus Figure 
(I) gives W(§) = (2,2,1,1) and 



2.5 



2 ~~ z ' 2 ~ ° 2' 3 ~~ 1 ' 1 + 1 ~~ z 3' 



while Figure 2.5 (II) gives W(^) = (5, 5, 5, 2, 2, 1, 1) and 
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